AITopics | Middletown

Collaborating Authors

Middletown

When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD

arXiv.org Artificial IntelligenceMar-23-2025

In this paper, I investigate the effectiveness of dataset cartography for extractive question answering on the SQuAD dataset. I begin by analyzing annotation artifacts in SQuAD and evaluate the impact of two adversarial datasets, AddSent and AddOneSent, on an ELECTRA-small model. Using training dynamics, I partition SQuAD into easy-to-learn, ambiguous, and hard-to-learn subsets. I then compare the performance of models trained on these subsets to those trained on randomly selected samples of equal size. Results show that training on cartography-based subsets does not improve generalization to the SQuAD validation set or the AddSent adversarial set. While the hard-to-learn subset yields a slightly higher F1 score on the AddOneSent dataset, the overall gains are limited. These findings suggest that dataset cartography provides little benefit for adversarial robustness in SQuAD-style QA tasks. I conclude by comparing these results to prior findings on SNLI and discuss possible reasons for the observed differences.

dataset cartography, machine learning, question answering, (17 more...)

arXiv.org Artificial Intelligence

2503.1829

Country:

North America > United States > Texas > Travis County > Austin (0.28)
North America > United States > Colorado (0.05)
Europe > Italy > Tuscany > Florence (0.05)
(9 more...)

Genre: Research Report > New Finding (0.87)

Industry: Leisure & Entertainment > Sports > Football (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.37)

Add feedback

Vertical Federated Image Segmentation

Mandal, Paul K., Leo, Cole

arXiv.org Artificial IntelligenceJan-15-2024

With the popularization of AI solutions for image based problems, there has been a growing concern for both data privacy and acquisition. In a large number of cases, information is located on separate data silos and it can be difficult for a developer to consolidate all of it in a fashion that is appropriate for machine learning model development. Alongside this, a portion of these localized data regions may not have access to a labelled ground truth. This indicates that they have the capacity to reach conclusions numerically, but are not able to assign classifications amid a lack of pertinent information. Such a determination is often negligible, especially when attempting to develop image based solutions that often necessitate this capability. With this being the case, we propose an innovative vertical federated learning (VFL) model architecture that can operate under this common set of conditions. This is the first (and currently the only) implementation of a system that can work under the constraints of a VFL environment and perform image segmentation while maintaining nominal accuracies. We achieved this by utilizing an FCN that boasts the ability to operate on federates that lack labelled data and privately share the respective weights with a central server, that of which hosts the necessary features for classification. Tests were conducted on the CamVid dataset in order to determine the impact of heavy feature compression required for the transfer of information between federates, as well as to reach nominal conclusions about the overall performance metrics when working under such constraints.

bottom model, federated learning, information, (12 more...)

arXiv.org Artificial Intelligence

2401.07931

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Delaware > New Castle County > Middletown (0.04)

Genre:

Research Report (0.64)
Overview (0.48)

Industry: Information Technology > Security & Privacy (0.86)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Unification of popular artificial neural network activation functions

Mostafanejad, Mohammad

arXiv.org Artificial IntelligenceJul-25-2023

We present a unified representation of the most popular neural network activation functions. Adopting Mittag-Leffler functions of fractional calculus, we propose a flexible and compact functional form that is able to interpolate between various activation functions and mitigate common problems in training neural networks such as vanishing and exploding gradients. The presented gated representation extends the scope of fixed-shape activation functions to their adaptive counterparts whose shape can be learnt from the training data. The derivatives of the proposed functional form can also be expressed in terms of Mittag-Leffler functions making it a suitable candidate for gradient-based backpropagation algorithms. By training multiple neural networks of different complexities on various datasets with different sizes, we demonstrate that adopting a unified gated representation of activation functions offers a promising and affordable alternative to individual built-in implementations of activation functions in conventional machine learning frameworks.

activation function, artificial intelligence, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2302.11007

Country:

North America > United States > Virginia > Montgomery County > Blacksburg (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Russia (0.04)
(13 more...)

Genre:

Instructional Material (0.69)
Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback